Structure and function in flow networks 
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This Letter presents a unified approach for the fundamental relationship between structure and 
function in flow networks by solving analytically the voltages in a resistor network, transforming 
the network structure to an effective all-to-all topology, and then measuring the resultant flows. 
Moreover, it defines a way to study the structural resilience of the graph and to detect possible 
communities. 
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The relation between structure and function is one of 
the most studied topics in the theory of complex net- 
works [11-01 ■ The structure is a topological representa- 
tion of the interacting elements forming a network and 
it is rigorously described by the theory of graphs. Be- 
haviour is a functional observable of the network and 
can be measured by a variety of different approaches and 
methodologies, e.g., it can be quantified as a statistical 
description of the weights of connections of the structure 
of the network [ll-Q or depend on the dynamics of inter- 
actions among the elements. 

This work deals with flow networks that satisfy con- 
servation laws (Kirchhoff's laws). The model for the flow 
network is stated in terms of resistor networks (weighted 
symmetric graphs), which have a source node s and a 
sink node t feeding the system, and a linear relationship 
between voltages and currents The problem models 
a wide variety of physical phenomena, such as river flow 
channels, transport networks, ion channels, and electrical 
circuits. Its solutions establish a clear relationship be- 
tween the topological structure of the networks (namely, 
adjacency matrix and edge weights, assumed known) and 
the functional flows passing through nodes and edges 
(that are a consequence of solving the flow model). The 
foundation of network flow theory roots back to Kirchhoff 
Q , answering what are the current flows in an electrical 
circuit when a set of voltages is applied. The solution is 
then achieved by solving Kirchhoff's equations [Mil. 



In this Letter, we present a unified approach for the 
fundamental relationship between structure and function 
in flow networks by calculating voltages (loads) when in- 
put currents (flows) are given. The main results of this 
approach are the following. 

First, a novel formula for the voltages in the flow 
network model, i.e., the Voltage-Flow (VF) problem 
[Eq. ^] , is analytically derived. The solution [Eq. ^] is 
expressed in terms of the known graph's weighted Lapla- 
cian matrix eigenspace base. In particular, we show that 
the behaviour of the flows is not only dependent on the 
matrix spectra, but intrinsically connected to the eigen- 



vectors of this matrix. 

Second, we show that the equivalent resistance (also 
known as two-point distance resistance) formula |13| is 
retrieved [Eq. ([7])] without further assumptions from the 
analytical expression for the voltages induced by the sin- 
gle s-t pair. The equivalent resistance between any two 
nodes is a topological measure that gives further informa- 
tion about the structure of any graph. It is shown here 
how it constitutes a highly useful tool to reduce the com- 
plexity of any network structure: an arbitrary topology 
is transformed to an effective all-to-all complete graph. 

Third, using both the voltage differences and the equiv- 
alent resistance, an effective Junctional flow adjacency 
(EFFA) between any two points in the circuit is con- 
structed [Eq. ([S])] . These effective flows are the real mea- 
surable observables that are obtained when a probe is 
placed between any two nodes in the network. The EFFA 
matrix expresses the level of connectivity in terms of flow 
transmission; the functional relationship among the net- 
work components in the effective all-to-all topology. In 
particular, if two nodes are not directly connected, an 
effective current between them may still exist. 

Fourth, as an application of our approach, we show 
how to identify the nodes that are mainly responsible for 
the transmission of flow, how to detect qualitatively com- 
munities of the network, and how to infer maximal node 
outflow for different graph topologies and resistor distri- 
butions. This reveals topological issues that are related 
to node maximum capacities in a natural manner. 

The starting point for our unified approach is the cal- 
culation of voltages in a network by solving the VF prob- 
lem. The VF problem is set by interpreting the graph 
structure as a linear resistor network circuit, which then 
receives an input flow / at node s and leaves at node t 
(a current / enters the circuit at a source and leaves at 
a sink). The network structure is given by a connected 
strict graph Q = {V, £} (where V and £ are the sets 
of nodes and edges, respectively) with symmetric edge 
weights Wij = Aij/Rij for i, j = 1, . . . , iV (iV being the 
number of nodes in V, Aij the ij-th element of the adja- 
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cency matrix, and Rij the edge's resistance). 

Imposing conservation of charge in every node of the 
network, i.e., a graph where at each node the current 
arriving at it equals the amount leaving it (first Kirch- 
hoff's law), the flow vector is {F^''*^)i = I {6is — Su) for 
i ~ 1, . . . , N, with 5ij being the Kronecker delta. Thus, 
using the laws of Kirchhoff and Ohm, the net current in 
any node is given by 



N 



A, 



Ri 



(1) 
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where is the voltage potential at node i given the 

particular s-t pair. Equation (IT|) is rearranged such that 
the VF problem is expressed in matrix form 



(2) 



where the upper-indexes indicate that the problem de- 
pends on the location of the s-t nodes on V and G is the 
weighted Laplacian matrix. The entries of G are 



if 



(7 . - / T,k=lW^k 

I -w., if 



(3) 



Then, the VF solution is achieved once the voltages in 
each node are found from inverting G. However, the 
rank of G is - 1 (det (G 



inversion is not possible jlll [1 



^=0^ A„ = 0) and direct 



The Laplacian inversion is addressed here by a different 
interpretation of the derivation of the generalized inverse 
matrix, also known as Moore- Penrose matrix [ll. 13|. 

The eigenvalue problem for the Laplacian is given by 
G P = P A, where Ay = Sij A„ (71 = i - 1 = 0, . . . , - 
1) is a diagonal matrix containing all eigenvalues and 
P = {^Oj ^^ij • ■ • 5 vn-i} is the matrix whose columns are 
the eigenvectors of G. We prove here that to express 
any element of G, the only elements needed from the 
spectral decomposition are the non-null eigenvalues and 
corresponding eigenvectors. This results in the following 
decomposition of G 



P. A. Pi 



(4) 



where T indicates transpose, P^ £ G 
-^{N-r) X {N-r) ^ g^^^ ^ bciug the number of connected com- 
ponents of Q, which in this Letter is fixed at r = 1. 

Equation Q is derived by analyzing each Laplacian 
matrix entry using the full spectral decomposition. In 



particular, dj = Ylk=i F'ik>^k-iPkj\ where for fc = 1, 
Ao = 0, so no contribution is added to the sum. Thus, 
observing that the inverse of an orthogonal matrix is its 
transpose (P^"^.^ = Pjk = {vk-i)^), the ij's entry of G 

is given by: dj = Y.k=2 (^fc-i)i ^k-i {vk-i)j, which is 
easily extendible to various connected components mak- 
ing the lower bound for the summation index k = r -\- 1. 



However, removing the null eigenspace of the spanning 
set of eigenvectors, the rank of the new G in Eq. (|4]) is iV 
and inversion can be directly performed. Consequently, 
each element of the generalized inverse Laplacian matrix 



(G ^)ij is found from 

N-l 
Zij = ^ {Vn 



Xr 



(5) 



with the new index n = k — 1 and i, j = 1, . . . , N . It is 
easy to show that X^I^i Z^j = Sij — ^/N, where the 
extra constant factor comes from the incompleteness of 
the restricted eigenspace and it cancels out when calcu- 
lating voltage differences. 

Returning to Eq. ([2|), the solution for the VF problem 

at a given node i g V is I^T/t**)^ 
I {Zis — Zit), 



Z]j=i Zij 



N-l 



(6) 



This formula is the backbone of our unified approach. 
It is applicable to any connected weighted graph and is 
extendible to many sources and sinks with different in- 
puts and outputs as long as flow conservation is fulfilled 
(E» Pt*' = and = Vi 7^ s or i). Furthermore, it 
allows to derive the equation for the equivalent resistance 
between any two nodes, to construct the effective flows, 
and later apply them to, e.g., community detection. 

The derivation of the equivalent resistance pij between 
any two nodes [ij is done as follows. Set the source at 
a starting node (i — s) and the sink at an ending point 
(j — t), then Eq. ([6|) provides an exact closed formula 
for Pij in terms of eigenvalues and eigenvectors of the 
weighted Laplacian matrix. The reason for doing this is 
that the voltage difference between these two nodes gives 
the incoming current times the resistance between them, 
hence, (y^'**))^ - {V^'*^)t = I Pst- Therefore, using the 
same procedure for every pair of nodes in the graph all 
Py's are determined. 



(7) 



providing a solution that is independent of where the s-t 
pair is placed. 

The matrix element pij is a topological measure that 
weighs all paths between any pair of nodes i and j. All 
paths between these nodes that are allowed by the adja- 
cency structure of the graph are collapsed to one equiv- 
alent link with weight pij. The result is that a single 
effective link is created between i and j. The important 
contribution is that this quantity presents the possibility 
to diminish the complexity of any graph to a simpler all- 
to-all equivalent topology with a betweenness score given 
by Pij. 
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As a working example, results of applying the solution 
given by Eqs. (|6]) and (0 to a ring graph, namely Cat, of 
= 16 nodes with resistors equal to one = 1 Vi, j) 
and input current 1=1 are shown in Fig. [T] For this 
network, the set of unordered eigenvalues is given by 



12|. 



A„ = 2 - 2cos(27r7i/iV), with n = 0,...,N ~1 

The top left panel in Fig. [1] shows that for the N {N — 
1) = 240 possibilities of s-t pairs in Cjv the maximum 
voltages achieved in the ring do not surpass 2 (arbitrary 
units). This is the maximum difference that the eigenvec- 
tor coordinates can achieve in the non-normalized frame 
(top right panel in Fig.[T]), hence giving a maximum volt- 
age difference of 4. This result shows that not only the 
eigenvalues of the Laplacian are important to infer the 
network's functional behaviour, but eigenvectors are also 
needed. Also, the maximum (minimum) is periodic, in 
the sense that the largest difference happens every time 
the source and sink are separated by 8 edges. 



-0.5 




FIG. 1: The top left panel has the maximum (black) and 
minimum (red) voltages that are achieved in an unweighed 
{Rij = 1 ring graph (Cjv) with A'^ = 16 nodes for every 

s-t pair in units of / = 1 for the input current at node s. The 
top right panel shows the eigenvectors (y/Nvn) coordinate 
values (color scale) of the corresponding weighed Laplacian 
matrix. The bottom left panel exhibits in color coded, for all 
these possible input-output configurations, the voltages V"/"'' 
of every node. The bottom right panel shows the equivalent 
resistor matrix (pij) for every pair of nodes in Cjv. 

The bottom panels in Fig. [1] show the voltage solu- 
tions for all the s-t pairs (left) and all equivalent resis- 
tances (right). The resulting values show the functional 
and structural symmetry that this simple network has. 
As all edge weights are unity {Rij = 1), the equivalent 
resistance pij maximum can only be 4. This is easily seen 
when calculating it directly by means of series and par- 
allel resistor calculations. For instance, for nodes i = 1 
and j = 9 there are two parallel paths with 8 edges in 
series, hence: p|j~g = | + | => pi^g = 4, which is the 
same value as element pi.g in the bottom right panel of 



Fig. [T] (found from Eq. (O). 

Having the voltage difference and equivalent resistance 
given by Eqs. ([6|) and ([T]), the third result of our unified 
approach is addressed: the construction of the effective 

(st) 

edge flows, namely, . We introduce these flows by 
finding the voltage differences among any two nodes in 
the effective topology given by pij. Thus, the EFFA ma- 
trix elements for each s-t configuration arc given by 



1 

Pij 



- V 



(8) 



where (F^"*)), > (y(**))j such that an effective current is 

( st) 

fiowing from node i to j, otherwise = 0. Hence, the 
s-t dependent EFFA matrix defines an effective directed 
network; which breaks the symmetry of the effective all- 
to-all topology structure given by pij. In particular, the 
physical current on each edge ij £ £ of the graph is given 



by/, 



(st) 



The voltage differences define a functional relationship 
among nodes and the equivalent resistance an effective 
topology structure. The introduction of the EFFA relates 
both structure and functional characteristics of the graph, 
thus, it enables to find functional hubs that might not 
be there in the topological structure, and detect highly 
connected areas (communities) in a functional way. In 
this sense, the s-t analysis acts similarly as how a dye 
does when introduced to a circulatory system of a person: 
it highlights a certain part of the network structure to 
detect what is being sought, e.g., clog blocking, arteries 
stiffness, etc. 




FIG. 2: The left panel shows the effective flow functional ad- 
jacency (EFFA) matrix for the unweighed Cjv graph of Fig. [T] 



where all source-sink pairs have been averaged, i.e., 



The 



right panel exhibits the same EFFA matrix for a particular 
source-sink pair, namely, s = 2 and t = 10 (<^'^'^'''). Each 
matrix element's magnitude is associated with a color scale. 

The mean EFFA matrix is defined to detect statisti- 
cally relevant functional observables. It is calculated by 
all the effective currents (Eq. ([8])) among the different 
pairs averaged over all possible realizations of source-sink 
pairs. Its matrix elements are 



■13 



(st) 



N 

E 



N 



{st) 
ij 



1) 



(9) 
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which define a matrix that constitutes an effective struc- 
tural flow quantity and is related to the equivalent resis- 
tance matrix. The EFFA matrix and its averaged coun- 
terpart for the example of the ring graph are shown in 
Fig. [21 where the left panel shows the mean EFFA matrix 
and the right panel exhibits the symmetry breaking for 
a particular s-t pair. 

With the three quantities analytically expressed (volt- 
ages in Eq. equivalent resistances in Eq. Q, and 
the effective currents in Eq. (|5|)), we investigate [ij] ran- 
dom graphs (RN) [l^, small- world (SW) networks (lij . 
scale- free (SF) topologies [l3|, and numerically generated 
clustered networks (CN) of = 2^ nodes. The follow- 
ing conclusions constitute the application of our unified 
approach to these concrete topologies. In particular, the 
differences between the respective adjacency and effec- 
tive topologies (equivalent resistance) are shown in the 
left and centre columns of Figs. [3] and SI 

To study other features of the EFFA matrix that are 
not represented in its averaged version and to answer how 
the flow is effectively distributed in any given topology, 
the probability density function (PDF) of the effective 
edge flows for every particular s-t is calculated. This 
does not only tells how diffusive or localized the transport 
of currents is as a function of the effective topology, but 
also gives the maximum flow values a particular graph 
can have, i.e., the maximum edge capacity. 

Results show that the EFFA PDFs for RN, SW, and 
SF networks are roughly invariant under changes of the 
s-t pair. However, SW and SF networks produce more 
high flow magnitudes ((^.-f ^ — 1) than RN for a broad 
range of parameters [iJ], hence, RNs distribute flows in 
a more uniform way than SF or SW networks. These 
observations arc seen in the example shown on the right 
column of Fig. [31 

The detection of communities is done by analysis of 
the mean EFFA matrix and the equivalent resistance. In 
general, the application of the EFFA analysis on numer- 
ically generated CNs qualitatively exhibits the communi- 
ties that the networks have. This is seen in the centre and 
right panels of the example exhibited in Fig.[4l where the 
communities are spotted as dark areas, both in pij and 
4>ij, though the contrast in the mean EFFA matrix is 
higher than in the equivalent resistance, implying that 
communities are better identified by (f)ij . The matrix 
not only detects the communities but also shows that the 
effective flows between communities are larger than the 
ones within each community. Hence, the intra-edges con- 
necting communities must have a high flow capacity as 
they represent the vulnerable links in the network. Re- 
moval of any of these edges generate drastic changes in 
the flows, leading to the isolation of communities. 

The observations on flow transmission and community 
detection have been previously shown by doing structural 
analysis of networks, e.g., in Refs. \m\M- ^^-^ particular. 




FIG. 3: The figure shows the following unweighted graphs: 
a random (top row), a scale-free (centre row), and a small- 
world (bottom row) network of A*' = 2^ nodes and connecting 
probability p = 0.05 [ij- The adjacency matrix (left column), 
equivalent resistance matrix (centre column in color coded), 
and effective functional flow adjacency PDFs (right column) 
for all s-t configurations (gray scale) are shown for each graph. 
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FIG. 4: The figure shows the adjacency matrix (left column), 
equivalent resistance (middle column), and the average effec- 
tive functional adjacency matrix (right column) for an un- 
weighed clustered network formed by 4 random graphs of 
A'c = 2^ [Hi. The network has a total of iV = 2^ nodes 
and each matrix element magnitude is associated with a color 
scale. 

the possibility of using voltages as a betweenness mea- 
sure score is pointed out in Refs. [ll-0| and is related to 
shortest-paths and random walks. With our unified ap- 
proach, and the four results that are addressed in this 
Letter, we show the practicality of it. Moreover, the lin- 
ear relationship between loads and flows, and conserva- 
tion of charge, are restrictions needed to define the flow 
network on top of the structure but not for the structure 
itself. Thus, the flow analysis is applicable to any net- 
work structure outside the domain of flow networks as 
long as the graph to be analysed is connected. 

An estimate of the computational time that our ap- 
proach requires is as follows. To calculate the voltages 
one needs the eigenvectors and eigenvalues of the Lapla- 
cian matrix. This is calculated in time 0{N"-), with a 
being an exponent that nowadays rounds 2 and N the 
number of nodes in the network. Then, if all possible 
source-sink pairs are analysed, the flows require 0{N^) 
to be found. This means that the algorithm is not as 
effective as other methods llSf but its mathematical for- 
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mulas still provide great information for obtaining upper 
and lower bounds for currents without the need for simu- 
lations. Furthermore, community detection can be quan- 
titatively described if other techniques are used, such as 
PDF analysis of the EFFA matrix elements. 
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